Stereo vision and segmentation

نویسنده

  • Andrew Blake
چکیده

I will describe models and algorithms for the real-time segmentation of foreground from background layers in stereo video sequences. Automatic separation of layers from color/contrast or from stereo alone is known to be error-prone. Here, color, contrast and stereo matching information are fused to infer layers accurately and efficiently. The stereo-match likelihood is then fused with a contrast-sensitive color model that is learned on the fly, and stereo disparities are obtained by dynamic programming. Our "Layered Graph Cut" (LGC) algorithm, does not directly solve stereo. Instead the stereo match likelihood is marginalized over disparities to evaluate foreground and background hypotheses, and then fused with a contrastsensitive color model. Segmentation is solved efficiently by graph cut optimization. In a recent development, this segmentation procedure has been used, in turn, to improve the efficiency of stereo matching, by exploiting Panum fusional bands that are well known to operate in human stereo vision. Biography Andrew Blake graduated in 1977 from Trinity College, Cambridge with a B.A. in Mathematics and Electrical Sciences. After a year as a Kennedy Scholar at MIT and two years in the defence electronics industry, he studied for a doctorate at the University of Edinburgh which was awarded in 1983. Until 1987 he was on the faculty of the department of Computer Science at the University of Edinburgh and a Royal Society Research Fellow. From 1987 to 1999, he has been on the faculty of the Department of Engineering Science in the University of Oxford, where he ran the Visual Dynamics Research Group, became a Professor in 1996, and and was a Royal Society Senior Research Fellow for 1998-9. In 1999 he moved to Microsoft Research Cambridge to lead the Vision Group. He was elected Fellow of the Royal Academy of Engineering in 1998, and Fellow of the Royal Society in 2005. In 2006 the Royal Academy of Engineering awarded him its Silver Medal. His main research activities are in computer vision. He has published several books including "Visual Reconstruction" with A.Zisserman (MIT press), "Active Vision" with A. Yuille (MIT Press) and "Active Contours" with M. Isard (Springer-Verlag). He has twice won the prize of the European Conference on Computer Vision, with R. Cipolla in 1992 and with M. Isard in 1996, and was awarded the IEEE David Marr Prize (jointly with K. Toyama) in 2001. He has served as programme chairman for the International Conference on Computer Vision in 1995 and 1999, and is on the editorial boards of the journals "Image and Vision Computing", the "International Journal of Computer Vision" and "Computer Vision and Image Understanding". Current research spans image interaction, stereo vision and motion tracking. Detailed accounts are available from publication lists, both newer papers from Microsoft Research and older papers from Oxford University. Recent research work with colleagues at Microsoft Research has been written up in the BBC's Science and Technology section and in the Guardian newspaper. 978-1-4244-1696-7/07/$25.00 ©2007 IEEE. 4

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Horizontal Disparity Estimation Algorithm Using Stereoscopic Camera Rig

83 Abstract— Image segmentation is always a challenging task in computer vision as well as in pattern recognition. Nowadays, this method has great importance in the field of stereo vision. The disparity information extracting from the binocular image pairs has essential relevance in the fields like Stereoscopic (3D) Imaging Systems, Virtual Reality and 3D Graphics. The term 'disparity' represen...

متن کامل

MAP ZDF segmentation and tracking using active stereo vision: Hand tracking case study

A maximum a posterior probability zero disparity filter (MAP ZDF) ensures coordinated stereo fixation upon an arbitrarily moving, rotating, re-configuring hand, performing marker-less pixel-wise segmentation of the hand. Active stereo fixation permits real-time foveal hand tracking and segmentation over a large visual workspace, allowing investigation of unrestricted natural human gesturing. Ha...

متن کامل

A Novel Segmentation Method for Crowded Scenes

Video surveillance is one of the most studied application in Computer Vision. We propose a novel method to identify and track people in a complex environment with stereo cameras. It uses two stereo cameras to deal with occlusions, two different background models that handle shadows and illumination changes and a new segmentation algorithm that is effective in crowded environments. The algorithm...

متن کامل

Locally Consistent ToF and Stereo Data Fusion

Depth estimation for dynamic scenes is a challenging and relevant problem in computer vision. Although this problem can be tackled by means of ToF cameras or stereo vision systems, each of the two systems alone has its own limitations. In this paper a framework for the fusion of 3D data produced by a ToF camera and a stereo vision system is proposed. Initially, depth data acquired by the ToF ca...

متن کامل

A Survey on Stereo Matching Techniques for 3D Vision in Image Processing

Extraction of three-dimensional scene from the stereo images is the most effective research area in the field of computer vision. Stereo vision constructs the actual three-dimensional scene from two stereo images having different viewpoints. Stereo matching is a correspondence problem, that means it ascertains which part of image corresponds to which part of another image ,where variations insi...

متن کامل

Self-localization of Indoor Mobile Robots Based on Artificial Landmarks and Binocular Stereo Vision

For the self-localization problem of the indoor mobile robots, a self-localization method of indoor mobile robots based on artificial landmarks and binocular stereo vision was proposed in this paper. First, a color scalable artificial landmark model is designed to give position information of the environment. Second, using the color segmentation, invariance of cross-ratio and self-adaptive wind...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007